Search CORE

117 research outputs found

Wavelet Integrated CNNs for Noise-Robust Image Classification

Author: Guo Sheng
Lai Zhihui
Li Qiufu
Shen Linlin
Publication venue
Publication date: 14/07/2020
Field of study

Convolutional Neural Networks (CNNs) are generally prone to noise interruptions, i.e., small image noise can cause drastic changes in the output. To suppress the noise effect to the final predication, we enhance CNNs by replacing max-pooling, strided-convolution, and average-pooling with Discrete Wavelet Transform (DWT). We present general DWT and Inverse DWT (IDWT) layers applicable to various wavelets like Haar, Daubechies, and Cohen, etc., and design wavelet integrated CNNs (WaveCNets) using these layers for image classification. In WaveCNets, feature maps are decomposed into the low-frequency and high-frequency components during the down-sampling. The low-frequency component stores main information including the basic object structures, which is transmitted into the subsequent layers to extract robust high-level features. The high-frequency components, containing most of the data noise, are dropped during inference to improve the noise-robustness of the WaveCNets. Our experimental results on ImageNet and ImageNet-C (the noisy version of ImageNet) show that WaveCNets, the wavelet integrated versions of VGG, ResNets, and DenseNet, achieve higher accuracy and better noise-robustness than their vanilla versions.Comment: CVPR accepted pape

arXiv.org e-Print Archive

Crossref

Computational Technologies for Fashion Recommendation: A Survey

Author: Chua Tat-Seng
Ding Yujuan
Lai Zhihui
Mok P. Y.
Publication venue
Publication date: 06/06/2023
Field of study

Fashion recommendation is a key research field in computational fashion research and has attracted considerable interest in the computer vision, multimedia, and information retrieval communities in recent years. Due to the great demand for applications, various fashion recommendation tasks, such as personalized fashion product recommendation, complementary (mix-and-match) recommendation, and outfit recommendation, have been posed and explored in the literature. The continuing research attention and advances impel us to look back and in-depth into the field for a better understanding. In this paper, we comprehensively review recent research efforts on fashion recommendation from a technological perspective. We first introduce fashion recommendation at a macro level and analyse its characteristics and differences with general recommendation tasks. We then clearly categorize different fashion recommendation efforts into several sub-tasks and focus on each sub-task in terms of its problem formulation, research focus, state-of-the-art methods, and limitations. We also summarize the datasets proposed in the literature for use in fashion recommendation studies to give readers a brief illustration. Finally, we discuss several promising directions for future research in this field. Overall, this survey systematically reviews the development of fashion recommendation research. It also discusses the current limitations and gaps between academic research and the real needs of the fashion industry. In the process, we offer a deep insight into how the fashion industry could benefit from fashion recommendation technologies. the computational technologies of fashion recommendation

arXiv.org e-Print Archive

Energy harvesting from a DE-based dynamic vibro-impact system

Author: Gu Gunmo
Lai Zhihui
Thomson Gordon
Val Dimitry V.
Yurchenko Daniil
Publication venue: 'IOP Publishing'
Publication date: 01/10/2017
Field of study

Heriot Watt Pure

Adaptive Locality Preserving Regression

Author: Chen Runze
Fei Lunke
Lai Zhihui
Wen Jie
Zhang Zheng
Zhong Zuofeng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

This paper proposes a novel discriminative regression method, called adaptive locality preserving regression (ALPR) for classification. In particular, ALPR aims to learn a more flexible and discriminative projection that not only preserves the intrinsic structure of data, but also possesses the properties of feature selection and interpretability. To this end, we introduce a target learning technique to adaptively learn a more discriminative and flexible target matrix rather than the pre-defined strict zero-one label matrix for regression. Then a locality preserving constraint regularized by the adaptive learned weights is further introduced to guide the projection learning, which is beneficial to learn a more discriminative projection and avoid overfitting. Moreover, we replace the conventional `Frobenius norm' with the special l21 norm to constrain the projection, which enables the method to adaptively select the most important features from the original high-dimensional data for feature extraction. In this way, the negative influence of the redundant features and noises residing in the original data can be greatly eliminated. Besides, the proposed method has good interpretability for features owing to the row-sparsity property of the l21 norm. Extensive experiments conducted on the synthetic database with manifold structure and many real-world databases prove the effectiveness of the proposed method.Comment: The paper has been accepted by IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), and the code can be available at https://drive.google.com/file/d/1iNzONkRByIaUhXwdEhOkkh_0d2AAXNE8/vie

arXiv.org e-Print Archive

University of Queensland eSpace

Discriminative Elastic-Net Regularized Linear Regression

Author: Lai Zhihui
Shao Ling
Wu Jian
Xie Guo-Sen
Xu Yong
Zhang Zheng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

In this paper, we aim at learning compact and discriminative linear regression models. Linear regression has been widely used in different problems. However, most of the existing linear regression methods exploit the conventional zeroone matrix as the regression targets, which greatly narrows the flexibility of the regression model. Another major limitation of theses methods is that the learned projection matrix fails to precisely project the image features to the target space due to their weak discriminative capability. To this end, we present an elastic-net regularized linear regression (ENLR) framework, and develop two robust linear regression models which possess the following special characteristics. First, our methods exploit two particular strategies to enlarge the margins of different classes by relaxing the strict binary targets into a more feasible variable matrix. Second, a robust elastic-net regularization of singular values is introduced to enhance the compactness and effectiveness of the learned projection matrix. Third, the resulting optimization problem of ENLR has a closed-form solution in each iteration, which can be solved efficiently. Finally, rather than directly exploiting the projection matrix for recognition, our methods employ the transformed features as the new discriminate representations to make final image classification. Compared with the traditional linear regression model and some of its variants, our method is much more accurate in image classification. Extensive experiments conducted on publicly available datasets well demonstrate that the proposed framework can outperform the state-of-the-art methods. The MATLAB codes of our methods can be available at http://www.yongxu.org/lunwen.html

The Hong Kong Polytechnic University Pao Yue-kong Library

Crossref

PolyU Institutional Repository

University of East Anglia digital repository

Precise Facial Landmark Detection by Reference Heatmap Transformer

Author: Lai Zhihui
Liu Jun
Min Wenwen
Shen Linlin
Sun Hang
Wan Jun
Xiong Ping
Zhou Jie
Publication venue
Publication date: 14/03/2023
Field of study

Most facial landmark detection methods predict landmarks by mapping the input facial appearance features to landmark heatmaps and have achieved promising results. However, when the face image is suffering from large poses, heavy occlusions and complicated illuminations, they cannot learn discriminative feature representations and effective facial shape constraints, nor can they accurately predict the value of each element in the landmark heatmap, limiting their detection accuracy. To address this problem, we propose a novel Reference Heatmap Transformer (RHT) by introducing reference heatmap information for more precise facial landmark detection. The proposed RHT consists of a Soft Transformation Module (STM) and a Hard Transformation Module (HTM), which can cooperate with each other to encourage the accurate transformation of the reference heatmap information and facial shape constraints. Then, a Multi-Scale Feature Fusion Module (MSFFM) is proposed to fuse the transformed heatmap features and the semantic features learned from the original face images to enhance feature representations for producing more accurate target heatmaps. To the best of our knowledge, this is the first study to explore how to enhance facial landmark detection by transforming the reference heatmap information. The experimental results from challenging benchmark datasets demonstrate that our proposed method outperforms the state-of-the-art methods in the literature.Comment: Accepted by IEEE Transactions on Image Processing, March 202

arXiv.org e-Print Archive

Case Report: Step-by-step procedures for total intracorporeal laparoscopic kidney autotransplantation in a patient with distal high-risk upper tract urothelial carcinoma

Author: Caiyong Lai
Caiyong Lai
Caiyong Lai
Dongming Ye
Guohao Wu
Haomin Li
Huilan Luo
Junqiang Li
Lishan Xie
Mubiao Chen
Zhihui Chen
Publication venue: 'Frontiers Media SA'
Publication date: 01/04/2023
Field of study

A 47-year-old man presented to the emergency department with right abdominal pain and a new onset of painless haematuria two weeks earlier. Urine cytology test results suggested urothelial carcinoma. Computed tomography urography (CTU) showed a filling defect in the lower right ureter with right hydronephrosis. Lymphadenopathy and any signs of metastatic disease were absent on CTU. Cystoscopy appeared normal. Creatinine level was also normal before surgery. After the treatment options were discussed, the patient chose to undergo 3D total intracorporeal laparoscopic kidney autotransplantation, bladder cuff excision, and segmental resection of the proximal two-thirds of the ureter based on the membrane anatomy concept. After more than one year of follow-up, the patient was in good health and showed no signs of haematuria. Surveillance cystoscopy and CTU examination showed no evidence of disease recurrence. Therefore, it is reasonable to assume that kidney-sparing surgery may be considered for carefully selected patients with high-grade upper tract urothelial carcinoma

Directory of Open Access Journals